IN THE SPECIFICATION: 

Please replace pages 10-14 ("BRIEF DESCRIPTION OF THE FIGURES"^ with 
the attached substitute pages 10-14, 

Please rewrite the paragraph beginning at page 45, line 2 as follows: 

-Sequence of the pGEX and mcgl (underlined) junction: 
pGEX-1 mcgl (695) 

Sj26. . .GAA TTC GGC ACG AGC CGA CGG (SEQ ID NO: 20) 
Additional amino acids: Glu Phe Gly Thr Ser (SEQ ID NO: 1 1 1).-- 



IN THE SEQUENCE LISTING: 

Please replace the Sequence Listing of Record with the attached substitute 
Sequence Listing, 



REMARKS 

Applicants respectfully submit that the specification has been amended to insert the 
sequence identifiers at pages 10-14 (Brief Description of the Figures) for those sequences 
disclosed in the figures. The specification has also been amended at page 45 to insert the 
sequence identifier for the amino acid sequence disclosed at line 2 of page 45. Applicants submit 
that the foregoing amendment does not introduce new matter. 

Applicants have also amended the Sequence Listing of record. More specifically, 
after SEQ ID NO: 9 on page 74 and before SEQ ID NO: 10 on page 75, there are three sequences 
that are designated as SEQ ID NO: 7, SEQ ID NO: 8, and SEQ ID NO: 9. Applicants 
respectfully submit that these sequences were introduced as a result of a clerical error, and have 
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been deleted from the Sequence Listing as indicated in the attached marked-up copy of the 
Sequence Listing. In addition, Applicants have reformatted the sequences of SEQ ID NO: 1 to 
SEQ ID NO: 45 using the Patentln Version 2. 1 . Furthermore, Applicants have added sequences 
designated as SEQ ID NOS: 46-1 1 1, which are disclosed in the Figures of the application as 
originally filed. 



Applicants submit that the foregoing amendment to the Sequence Listing does not 



introduce new matter. Applicants submit herewith a substitute paper and initial computer 
readable copy of the Sequence Listing, along with a Statement Under 37 C.F.R. §1.82 1(f), stating 
that these copies are identical. A copy of the Notice to Comply is also enclosed. 



Attached hereto is a marked-up version of the changes made to the specification and 



claims by the instant amendment. The attached page is captioned " Version with Markings to 
Show Changes Made. " 

In view of the foregoing amendments and remarks, it is firmly believed that the 
subject application is in condition for allowance, which action is earnestly solicited. 

Respectfully submitted, 



SCULLY, SCOTT, MURPHY & PRESSER 
400 Garden City Plaza 
Garden City, New York 11530 
(516) 742-4343 

FSD/XZ:ab 

Ends.: Substitute copy of pages 10-14 of the Specification 
Version with Markings to Show Changes Made 
Marked-up copy of the Sequence Listing 



Frank S. DiGiglio ^ 
Registration No. 3 1 ,346 
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DEC 1 8 2001 £ 

Serial No: 09/424,458 N»v J& 

VERSION WITH MARKINGS TO SHOW CHANGES MADE 
IN THE SPECIFICATION: 

Please amend the paragraph beginning at page 45, line 2. as follows : 

—Sequence of the pGEX and mcgl (underlined) junction: 
pGEX-1 mcgl (695) 

Sj26. . .GAA TTC GGC ACG AGC CGA CGG (fSEQ ID NO: 20}) 
Additional amino acids: Glu Phe Gly Thr Ser (SEP ID NO: 111). — 
Please amend pages 10-14 ("BRIEF DESCRIPTION OF THE FIGURES") as follows: 

-BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a representation of the nucleotide sequence (f SEQ ID NO: 2}) and 
corresponding amino acid sequence £{SEQ ID NO: 3}) of mcgA. 

Figure 2 is a representation of the alignment of the human MCG4 amino acid sequence 
(amino acids: 1-60. SEP ID NO: 46) (Query) with a translation of a partial murine expressed 
sequence tag (EST) (SEP ID NO: 47) (Subject) . 

Figure 3 is a representation of the alignment of the human MCG4 amino acid sequence 
(amino acids 7-109: SEP ID NO: 48) (Query) with a translation of a partial nematode EST (SEP 
ID NO: 49) (subject) . 

Figure 4 is a diagrammatic representation showing a predicted structure of MCG4 where 
H and C represent histidine and cysteine residues, respectively and X refers to any amino acid 
residue. Zn [represent] represents zinc atoms. 

Figure 5 is a representation of sensitive sequence homology search of related cysteine- 

4 

F:\WORK\536\13 198\amend\13 198.pre 



containing motifs in another Caenorhabditis elegans protein. Query: amino acids 56-100 (SEP 
ID NO: 50). Subject: SEP ID NO: 51. 

Figure 6 is a representation showing that a related cysteine containing motif is present in 
the GATA-binding transcription factor from Sacchromyces pombe. Queries: amino acids 35-58 
(SEP ID NO: 52); amino acids 162-227 (SEP ID NO: 54). Subjects: amino acids 175-198 (SEP 
ID NP: 53): amino acids 441-506 (SEP ID NP: 55). 

Figure 7 is a Northern blot showing expression of mcgA in various cultured human cancer 
cell lines. Lanes 1-5, respectively, represent the hybridization signal from 15 ^g total RNA 
derived from various human cancer cells lines. Lanes 1-5, respectively, contain RNA from H69 
lung carcinoma cells, JAM ovary carcinoma cells, BT20 breast carcinoma cells, HaCat 
transformed keratinocytes, T24 bladder carcinoma cells. 

Figure 8 is a representation of a partial alignment of mcg4 with human ESTs AA074703 
andAA134788. Pueries: nucleotides 446-704 (SEP ID NP: 56); nucleotides 398-452 (SEP ID 
NP: 58): nucleotides 767-810 (SEP ID NP: 60): nucleotides 731-765 (SEP ID NP: 62): 
nucleotides 701-732 (SEP ID NP: 64): nucleotides 498-687 (SEP ID NP: 66): nucleotides 398- 
495 (SEP ID NP: 68): nucleotides 702-761 (SEP ID NP: 70). Subjects: nucleotides 49-307 
(SEP ID NP: 57): nucleotides 2-56 (SEP ID NP: 59); nucleotides 373-416 (SEP ID NP: 61): 
nucleotides 336-370 (SEP ID NP: 63): nucleotides 305-336 (SEP ID NP: 65); nucleotides 103- 
292 (SEP ID NP: 67): nucleotides 2-99 (SEP ID NP: 69): nucleotides 309-368 (SEP ID NP: 1). 

Figure 9 is a representative of the partial nucleotide sequence alignment between a human 
(W32939) (SEP ID NP: 72) and mouse (AA242159) (SEP ID NP: 73) mcg4-like EST in the 
putative 5' UTR of the mcgA cDNA. The putative initiation codon is underlined and the region 
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upstream represents 5' UTR. 

Figure 10 is a representation showing MacVector alignment of MCG4 with forward 
translations of ESTs AA 134788 and AA074703. The nucleotide sequences are shown in Figure 
8. 

Figure 1 1 is a diagrammatic representation of the domains of MCG4 
zinc finger consensus: CX2HX4CX2CX4HX 2 CX,7CX2CXi 8 HX2CXi 8 CX 2 C (SEP ID NO: 74): 
acidic domain consensus: 9/34 amino acids negatively charged, 0/34 positively charged; 
basic domain consensus: 13/55 amino acids positively charged, 0/55 negatively charged; 
leucine zipper domain consensus: LX^LXeRX^LX^L (SEP ID NO: 75); 
alternate "novel" leucine zipper-like motif where leucine would not be aligned along the one 
surface to an alpha helix domain: (aa261) LX 6 LXLX 6 LXLX 6 L (aa286) (SEP ID NP: 76) . 

Figure 12 is a representation showing similarity of MCG7 WITH GEFs of various 
organisms. 

Figure 13(a) is a representation of the nucleotide sequence (fSEQ ID NP: 4}) and 
corresponding amino acid sequence (fSEQ ID NP: 5}) of mcg7. Nucleotides 183-288 are an 
alternative spliced exon (shown in lower case). 

Figure 13(b) is a representation of the partial nucleotide sequence {fSEQ ID NP: 6}} and 
corresponding amino acid sequence ([ SEQ ID NP: 7}) PF mcg7 but without the exon shown in 
Fig. 13(a). Amino acids have been numbered from the first methionine codon (underlined). The 
cDNA molecules of Fig. 13(a) and Fig. 13(b) differ by the inclusion and exclusion of the exon of 
nucleotides 183-288. 

Figure 14 is a representation showing a comparison between MCG7 (SEP ID NP: 7) and 
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a homologue from Caenorhabditis elegans (SEP ID NO: 77) using the BESTFIT algorithm. In 

the figure, the following sequences are underlined: 

EF-Hand=PROSITE DATABASE NO. PD0C00018 

la nematode DVDEEDEVEDIEF ({SEQ ID NO: 10}) 

lb human DVDGDGHISQEEF ({SEQ ID NO: 1 1}) 

nematode DHDRDGFISQEEF ({SEQ ID NO: 1 2}) 

lc human DQNQDGCISREEM ({SEQ ID NO: 13}) 

nematode DVDMDGQISKDEL ({SEQ ID NO: 14}) 

GUANINE NT BINDING REGION = BLOCKS DATABASE NO. BL00720B 

2 human HFVHVAEKLLQLQNFNTLMAWGGLSHSSISRLKETH ({SEQ ID NO: 1 5}) 
nematode KFVHVAKHLRKINNFNTLMSWGGITHSSVARLAKTY ({SEQ ID NO: 16}) 

DaG-PE BINDING DOMAIN = PROSITE DATABASE NO. PDOC00379 

3 human HNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQCKDRLSVEC 

({SEQ ID NO: 17}) 

nematode HNFHETTFLTPTTCNHCNKLLWGILRQGFKCKDCGLAVHSCCKSNAVAEC 
({SEQ ID NO: 18}) 

Figure 15 is a representation of an alignment of human and a partial (5' UTR and partial 
coding sequence) murine mcgl cDNA (GenBank Acc.No. W71787 and AA237373). The 
putative initiation codon is underlined. The murine sequence (SEP ID NO: 78) represents a 
composite of 2 partial cDNA sequences from the EST database (accession numbers W71787 and 
AA237373). Nucleotide differences between human and murine sequences are shown in lower 
case lettering and identical residues are indicated with asterisks. 
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Figure 16 is a representation of further 5' nucleotide (SEP ID NO: 79) and corresponding 
amino acid sequence (SEP ID NO: 80) human mcgl. Nucleotide positions 1-321 were derived 
from GenBank Acc. No. AC000134 and nucleotides 322 onwards from Figure 13(a). Two in- 
frame initiation codons are underlined. Asterisks denote in-frame stop codons. 

Figure 17 is a graphical representation of a GDP release assay. □Experiment #1 (mean of 
duplicate). OExperiment #2 (mean of duplicates). The exchange reaction contained 36pmols of 
GST-MCG (N-terminally truncated; encoded by Construct B in Fig. 18) and 1.6-12.8 pmols of 
recombinant GST-N-Ras.GDP. Reaction time 6 min. 
Estimated reaction constants: 
K m = 2.1 jiM, V™^ 37 pMol/6min/36pMol [Expt.#l] 
K m = l.5\M 9 V max = 30.3pMol/6min/36pMol[Expt.#2] 

Figure 18 depicts various recombinant plasmids containing partial or full-length mcg7. 

Figure 19 is a representation of the nucleotide sequence (£SEQ ID NO: 8}) and 
corresponding amino acid sequence ({SEQ ID NO: 9}) of mcgl 8. 

Figure 20 is representation showing that MCG18 (amino acids 35-96, SEP ID NO: 811 
(Query) has partial homology to E. coli DnaJ (amino acids 6-65, SEP ID NO: 82) (Subject) . 

Figure 21 is a representation showing that MCG18 has homotology to two Caenorhabitis 
elegans proteins. Pueries: amino acids 28-64 (SEP ID NP: 83): amino acids 71-102 (SEP ID 
NP: 85); amino acids 162-203 (SEP ID NP: 87); amino acids 35-75 (SEP ID NP: 89): amino 
acids 75-96 (SEP ID NP:91); amino acids 141-184 (SEP ID NP: 93). Subjects: amino acids 22- 
58 (SEP ID NP:84); amino acids 64-95 (SEP ID NP: 86); amino acids 158-199 (SEP ID NP: 
88); amino acids 19-50 (SEP ID NP: 90); amino acids 71-92 (SEP ID NP: 92); amino acids 
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IN THE SEQUENCE LISTING: 

Please amend the Sequence Listing of Record as indicated in the attached marked-up copy 
of the amended sequence listing. 
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WO 98/53061 



PCT/AU98/00380 



I SEQUENCE LISTING 



57 



% (J) GENERAL INFORMATION: 




A' (i\APPLICANT: (OTHER THAN US): The Council of The Queensland Institute of 
■ ^ Medical Research 

(US ONLY): HAYWARD Nicholas, SILINS Gintdk, GRJMMOND Sean 
GARTSUDkMichael and HANCOCK. John 



(ii) TITLE OMNVENTION:A NOVEL GENE AND USES \ 

(iii) NUMBER OFNSEQUENCES: 45 

(iv) CORRESPONDENCE ADDRESS : 

(A) ADDRESSEE: DA VIES COLLISON CA^ 

(B) STREET: 1 UTTlX COLLINS STREET 

(C) CITY: ME LB 01 

(D) STATE: VICTORIA 

(E) COUNTRY: AUSTRAL! 

(F) ZIP: 3000 

(v) COMPUTER READABLE FORM:1 

(A) MEDIUM TYPE: Floppy disk. 

(B) COMPUTER: IBM PC comoaiible 

(C) OPERATING SYSTEM: P|Z-DOS/MS^DOS 

(D) SOFTWARE: Patentln Reflease #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: PCT INTERNATIONAL 

(B) FILING DATE: 22&1AY- 1998 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: P06973 

(B) FILING DATE: 23-MAY-1997 

(C) CLASSIFICATION : 

(vii) PRIORAPPLICATION DATA: 

(A) APPLICATION NUMBER: P06974 

(B) filing date: 23-MAY-1997 

(C/CLASSDFICATION: 

(vijjj PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: P06972 

(B) FILING DATE: 23 MAY 1997 



EREFOR 
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(C) CLAsaincATJOW 

(vii) PRiQR APPLICATION DATA: 

(A) APPLICATION NUMBER: PP1459 

(B) FEJNG^ATE: 22-JAN-1998 

(C) CI^SSEFICATION: 

(vii) PRIOR APPLICATION^ATA: 

(A) APPLICATION NUMBER P£<f460 

(B) FILING DATE: 22-JAN- 195 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION6ATA: 

(A) APPLICATION NUMBER: PP1458 

(B) FILING DATE/22-J AN- 1998 

(C) CLASSOTC/TION: 

(viii) ATTORNEY/AGENT INFORMATION: 
(A) NAME: HUGHES. DR E JOHN L 
(C) REFERENCE/DOCKET NUMBER: EJH/AF 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +61 3 9254 2777 

(B) TELEFAX: +61 3 9254 2770 
(0)TELLX AA 31707 ; 
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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

Cys Xaa Xaa Cys Xaa Gly Xaa Gly 
5 



(2) INFORMATION FOR SEQ ID NO: 2: 

{i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH : 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear ' 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 30.. 959 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 



TCAGTAAACA CAGAGACTGG GGATCGATC ATG GGG CTT TGT AAG TGC CCC AAG 53 

Met Gly Leu Cys Lys Cys Pro Lys 

AGA AAG GTG ACC AAC CTG TTC TGC TTC GAA CAT CGG GTC AAC GTC TGC 
Arg Lys Val Thr Asn Leu Phe Cys Phe Glu His Arg Val Asn Val Cys 
10 15 20 

GAG CAC TGC CTG GTA GCC AAT CAC GCC AAG TGC ATC GTC CAG TCC TAC 
Glu His Cys Leu Val Ala Ash His Ala Lys Cys lie Val Gin Ser Tyr 
25 30 35 Jo 

CTG CAA TGG CTC CAA GAT AGC GAC TAC AAC CCC AAT TGC CGC CTG TGC 
Leu Gin Trp Leu Gin Asp Ser Asp Tyr Asn Pro Asn Cys Arg Leu Cvs 
45 50 55 

AAC ATA CCC CTG GCC AGC CGA GAG ACG ACC CGC CTT GTC TGC TAT GAT 
Asn lie Pro Leu Ala Ser Arg Glu Thr Thr Arg Leu Val Cys Tyr Asp 
60 .65 70 

CTC TTT CAC TGG GCC TGC CTC AAT GAA CGT GCT GCC CAG CTA CCC CGA 
Leu Phe His Trp Ala Cys Leu Asn Glu Arg Ala Ala Gin Leu Pro Aro 
75 . 80 85 

AAC ACG GCA CCT GCC GGC . TAT CAG TGC CCC AGC TGC AAT GGC CCC ATC 
Asn Thr Ala Pro Ala Gly Tyr Gin Cys Pro Ser Cys Asn Gly Pro He 
90 95 ioo 

TTC CCC CCA ACC AAC CTG GCT GGC CCC GTG GCC TCC GCA CTG AGA GAG 
Phe Pro Pro Thr Asn Leu Ala Gly Pro Val Ala Ser Ala Leu Arg Glu 
105 HO 115 . ^ 120 



101 



149 



197 



245 



293 



341 



389 
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AAG CTG GCC ACA GTC AAC TGG GCC CGG GCA GGA CTG GGC CTC CCT CTG 43 7 

Lys Leu Ala Thr Val Asn Trp Ala Arg Ala Gly Leu Gly Leu Pro Leu 
125 130 135 

ATC GAT GAG GTG GTG AGC CCA GAG CCC GAG CCC CTC AAC ACG TCT GAC 4 85 

He Asp Glu Val Val Ser Pro Glu Pro Glu Pro Leu Asn Thr Ser Asp 
140 145 150 

TTC TCT GAC TGG TCT AGT TTT AAT GCC AGC AGT ACC CCT GGA CCA GAG 533 
Phe Ser Asp Trp Ser Ser Phe Asn Ala Ser Ser Thr Pro Gly Pro Glu 
155 160 165 

GAG GTA GAC AGC GCC TCT GCT GCC CCA GCC TTC TAC AGC CGA GCC CCC 581 
Glu Val Asp Ser Ala Ser Ala Ala Pro Ala Phe Tyr Ser Arg Ala Pro 
170 175 ISO 



CGG CCC CCA GCT TCC CCA GGC CGG CCC GAG CAG CAC ACA GTG ATC CAC 
Arg Pro Pro Ala Ser Pro Gly Arg Pro Glu Gin His Thr Val He His 
185 190 195 200 



GCG GGG CTG CTG CTA CTC TTG GGA CTG CTG GGC TTC CTG GCC CTC CTT 
Ala Gly Leu Leu Leu Leu Leu Gly Leu Leu Gly Phe Leu Ala Leu Leu 
265 270 275 2 80 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 310 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



629 



ATG GGC AAT CCT GAG CCC TTG ACT CAC GCC CCT AGG AAG GTG TAT GAT 677 
Met Gly Asn Pro Glu Pro Leu Thr His Ala Pro Arg Lys Val Tyr Asp 
205 210 215 

ACG CGG GAT GAT GAC CGG ACA CCA GGC CTC CAT GGA GAC TGT GAC GAT 725 
Thr Arg Asp Asp Asp Arg Thr Pro Gly Leu His Gly Asp Cys Asp Asp 
220 225 230 

GAC AAG TAC CGA CGT CGG CCG GCC TTG GGT TGG CTG GCC CGG CTG CTA 773 
Asp Lys Tyr Arg Arg Arg Pro Ala Leu Gly Trp Leu Ala Arg Leu Leu 
235 240 245 

AGG AGC CGG GCT GGG TCT' CGG AAG CGG CCG CTG ACC CTG CTC CAG CGG 821 
Arg Ser Arg Ala Gly Ser Arg Lys Arg Pro Leu Thr Leu Leu Gin Ara 
250 255 260 



869 



GCC CTC ATG TCT CGC CTA GGC CGG GCC GCA GCT GAC AGC GAT CCC AAC 917 
Ala Leu Met Ser Arg Leu Gly Arg Ala Ala Ala Asp Ser Asp Pro Asn 
285 290 295 

CTG GAC CCA CTC ATG AAC CCT CAC ATC CGC GTG GGC CCC TCC TGA 9 62 
Leu Asp Pro Leu Met Asn Pro His lie Arg Val Gly Pro Ser 
300 305 310 



GCCCCCTTGC 


TTGTGGCTAG 


GCCAGCCTAG 


GATGTGGGTT 


CTGTGGAGGA 


GAGGCGGGGT 


1022 


AATGGGGAGG 


CTGAGGGCAC 


CTCTTCACTG 


CCCCTCTCCC 


TCAAGCCTAA 


G ACAC T AAG A . 


1082 


CCCCAGACCC 


AAAGCCAAGT 


CCACCAGAGT 


GGCTCGCAGG 


CCAGGCCTGG 


AGTCCCCGTG 


1142 


GGTCAAGCAT 


TTGTCTTGAC 


TTGCTTTCTC 


CCGGGTCTCC 


AGCCTCCGAC 


CCCTCGCCCC 


1202 


ATGAAGGAGC 


TGGCAGGTGG 


AAATAAACAA 


CAACTTTATT 






1242 



(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Met Gly Leu Cys Lys Cys Pro Lys Arg Lys Val Thr Asn Leu Phe Cys 
1 '5 10 is 

Phe Glu His Arg Val Asn Val Cys Glu His Cys Leu Val Ala Asn His 
20 25 30 

Ala Lys Cys lie Val Gin Ser Tyr Leu Gin Trp Leu Gin Asp Ser Asp 
35 40 45 

Tyr Asn Pro Asn Cys Arg Leu Cys Asn lie Pro Leu Ala Ser Arg Glu 
.50 55 60 

Thr Thr Arg Leu Val Cys Tyr Asp Leu Phe His Trp Ala Cys Leu Asn 
65 70 75 80 

Glu Arg Ala Ala Gin Leu Pro Arg Asn Thr Ala Pro Ala Gly Tyr Gin 
85 90 95 

Cys Pro Ser Cys Asn Gly Pro lie Phe Pro Pro Thr Asn Leu Ala Gly 
100 105 HO 

Pro Val Ala Ser Ala Leu Arg Glu Lys Leu Ala Thr Val Asn Trp Ala 
115 120 125 

Arg Ala Gly Leu Gly Leu Pro Leu lie Asp Glu Val Val Ser Pro Glu 
130 135 140 

Pro Glu Pro Leu Asn Thr Ser Asp Phe Ser Asp Trp Ser Ser Phe Asn 
145 150 155 160 

Ala Ser Ser Thr Pro Gly Pro Glu Glu Val Asp Ser Ala Ser Ala Ala 
165 170 175 

Pro Ala Phe Tyr Ser Arg Ala Pro Arg Pro Pro Ala Ser Pro Gly Arg 
180 185 190 

Pro Glu Gin His Thr Val He His Met Gly Asn Pro Glu Pro Leu Thr 
195 200 205 

His Ala Pro Arg Lys Val Tyr Asp Thr Arg Asp Asp Asp Arg Thr Pro 
210 215 220 

Gly Leu His Gly Asp Cys Asp Asp Asp Lys Tyr Arg Arg Arg Pro Ala 
225 230 235 240 

Leu Gly Trp Leu Ala Arg Leu Leu Arg Ser Arg Ala Gly Ser Arg Lvs 
245 250 255 

Arg Pro Leu Thr Leu Leu Gin Arg Ala Gly Leu Leu Leu Leu Leu Glv 
260 265 270 

Leu Leu Gly Phe Leu Ala Leu Leu Ala Leu Met Ser Arg Leu Glv Arc 
275 280 285 

Ala Ala Ala Asp Ser Asp Pro Asn Leu Asp Pro Leu Met Asn Pro His 
290 295 300 

He Arg Val Gly Pro Ser 
305 310 



(2) INFORMATION FOR SEQ ID NO : 4 : 

( i ) SEQUENCE CHARACTERISTICS : 



I 
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(A) LENGTH: 2415 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME /KEY ; CDS ' 

(B) LOCATION: 3 . .-2*8* 2l 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

CG ATT TCA TTC CTC GCT CCC CAC AGG TCC CTC TCC CCA AAA TAT TCC 47 
lie Ser Phe Leu Ala Pro His Arg Ser Leu Ser Pro Lys Tyr Ser 
15 10 15 

CAT CTT GTC CTA GCC CAT CCC CCA GAC TAT CTC AAG GAC CAG CTG TCC 95 
His Leu Val Leu Ala His Pro Pro Asp Tyr Leu Lys Asp Gin Leu Ser 
20 25 30 

CCA CGC CCC CGA CCT CCA CTA GGC CTG TGC CAC CCG CTG CCT GCA GGA 143 
Pro Arg Pro Arg Pro Pro Leu Gly Leu Cys His Pro Leu Pro Ala Glv 
35 40 45 

AGA CGC CCG GTC CCG GGC CGG GTT AGC CCC ATG GGA ACG CAG CGC CTG 191 
Arg Arg Pro Val Pro Gly Arg Val Ser Pro Met Gly Thr Gin Arq Leu 
50 55 60 

TGT GGC CGC GGG ACT CAA GGC TGG CCT GGC TCA AGT GAA CAG CAC GTC 23 9 

Cys Gly Arg Gly Thr Gin Gly Trp Pro Gly Ser Ser Glu Gin His Val 
65 70 75 

CAG GAG GCG ACC TCG TCC GCG GGT TTG CAT TCT GGG GTG GAC GAG CTG 287 
Gin Glu Ala Thr Ser Ser Ala Gly Leu His Ser Gly Val Asp Glu Leu 
80 85 90 95 

GGG GTT CGG TCC GAG CCC GGT GGG AGG CTC CCG GAG CGC AGC CTG GGC 3 35 

Gly Val Arg Ser Glu Pro Gly Gly Arg Leu Pro Glu Arg Ser Leu Gly 
100 105 no 

CCA GCC CAC CCC GCG CCG . GCG GCC ATG GCA GGC ACC CTG GAC CTG GAC 3 83 

Pro Ala His Pro Ala Pro Ala Ala Met Ala Gly Thr Leu Asp Leu Asd 
115 120 125 

AAG GGC TGC ACG GTG GAG GAG CTG CTC CGC GGG TGC ATC GAA GCC TTC 431 
Lys Gly Cys Thr Val Glu Glu Leu Leu Arg Gly Cys He Glu Ala Phe 
130 135 140 

GAT GAC TCC GGG AAG GTG CGG GAC CCG CAG CTG GTG CGC ATG TTC CTC 479 
Asp Asp Ser Gly Lys Val Arg Asp Pro Gin Leu Val Arg Met Phe Leu 
145 150 155 

ATG ATG CAC CCC TGG TAC ATC CCC TCC TCT CAG CTG GCG GCC AAG CTG 527 
Met Met His Pro Trp Tyr He Pro Ser Ser Gin Leu Ala Ala Lys Leu 
16 ° 165 170 175 

CTC CAC ATC TAC CAA CAA TCC CGG AAG GAC AAC TCC AAT TCC CTG CAG 57 5 

Leu His He Tyr Gin Gin Ser Arg Lys Asp Asn Ser Asn Ser Leu Gin 
190 185 190 

GTG AAA ACG TGC CAC CTG GTC AGG TAC TGG ATC TCC GCC TTC CCA GCG 62 3 

Val Lys Thr Cys His Leu Val Arg Tyr Trp He Ser Ala Phe Pro Ala 
195 200 205 
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GAG TTT GAC TTG AAC CCG GAG TTG GCT GAG CAG ATC AAG GAG CTG AAG 671 
Glu Phe Asp Leu Asn Pro Glu Leu Ala Glu Gin lie Lys Glu Leu Lys 
210 215 220 

GCT CTG CTA GAC CAA GAA GGG AAC CGA CGG CAC AGC AGC CTA ATC GAC 719 
Ala Leu Leu Asp Gin Glu Gly Asn Arg Arg His Ser Ser Leu lie Asp 
225 230 235 

ATA GAC AGC GTC CCT ACC TAC AAG TGG AAG CGG CAG GTG ACT CAG CGG 767 
He Asp Ser Val Pro Thr Tyr Lys Trp Lys Arg Gin Val Thr Gin Arg 
240 245 250 255 

AAC CCT GTG GGA CAG AAA AAG CGC AAG ATG TCC CTG TTG TTT GAC CAC 815 
Asn Pro Val Gly Gin Lys Lys Arg Lys Met Ser Leu Leu Phe Asp His 
. 260 265 270 

CTG GAG CCC ATG GAG CTG GCG GAG CAT CTC ACC TAC TTG GAG TAT CGC 863 
Leu Glu Pro Met Glu Leu Ala Glu His Leu Thr Tyr Leu Glu Tyr Arg 
275 280 285 

TCC TTC TGC AAG ATC CTG TTT CAG GAC TAT CAC AGT TTC GTG ACT CAT 911 
Ser Phe Cys Lys He Leu Phe Gin Asp Tyr His Ser Phe Val Thr His 
290 295 300 

GGC TGC ACT GTG GAC AAC CCC GTC CTG GAG CGG TTC ATC TCC CTC TTC ' 959 
Gly Cys Thr Val Asp Asn Pro Val Leu Glu Arg Phe He Ser Leu Phe 
305 310 315 

AAC AGC GTC TCA CAG TGG GTG CAG CTC ATG ATC CTC AGC AAA CCC ACA 1007 
Asn Ser Val Ser Gin Trp Val Gin Leu Met He Leu Ser Lys Pro Thr 
320 325 330 335 

GCC CCG CAG CGG GCC CTG GTC ATC ACA CAC TTT GTC CAC GTG GCG GAG 1055 
Ala Pro Gin Arg Ala Leu Val He Thr His Phe Val His Val Ala Glu 
340 345 350 

AAG CTG CTA CAG CTG CAG AAC TTC AAC ACG CTG ATG GCA GTG GTC GGG 1103 
Lys Leu Leu Gin Leu Gin Asn Phe Asn Thr Leu Met Ala Val Val Gly 
355 360 365 

GGC CTG AGC CAC AGC TCC ATC TCC CGC CTC AAG GAG ACC CAC AGC CAC 1151 
Gly Leu Ser His Ser Ser He Ser Arg Leu Lys Glu Thr His Ser His 
370 375 380 

GTT AGC CCT GAG ACC ATC AAG CTC TGG GAG GGT CTC ACG GAA CTA GTG 1199 
Val Ser Pro Glu Thr He Lys Leu Trp Glu Gly Leu Thr Glu Leu Val 
385 390 395 

ACG GCG ACA GGC AAC TAT GGC AAC TAC CGG CGT CGG CTG GCA GCC TGT 1247 
Thr Ala Thr Gly Asn Tyr Gly Asn Tyr Arg Arg Arg Leu Ala Ala Cys 
400 405 410 415 

GTG GGC TTC CGC TTC CCG ATC CTG GGT GTG CAC CTC AAG GAC CTG GTG 129 5 

Val Gly Phe Arg Phe Pro He Leu Gly Val His Leu Lys Asp Leu Val 
420 425 430 

GCC CTG CAG CTG GCA CTG CCT GAC TGG CTG GAC CCA GCC CGG ACC CGG 1343 
Ala Leu Gin Leu Ala Leu Pro Asp Trp Leu Asp Pro Ala Arg Thr Arg 
435 440 445 

CTC AAC GGG GCC AAG ATG AAG CAG CTC TTT AGC ATC CTG GAG GAG CTG 13 91 

Leu Asn Gly Ala Lys Met Lys Gin Leu Phe Ser lie Leu Glu Glu Leu 
450 455 460 

GCC ATG GTG ACC AGC CTG CGG CCA CCA GTA CAG GCC AAC CCC GAC CTG 14 39 

Ala Met Val Thr Ser Leu Arg Pro Pro Val Gin Ala Asn Pro Asp Leu 
465 ' 470 475 
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CTG AGC CTG CTC ACG GTG TCT CTG GAT CAG TAT CAG ACG GAG GAT GAG 14 87 

Leu Ser Leu Leu Thr Val Ser Leu Asp Gin Tyr Gin Thr Glu Asp Glu 
480 485 490 495 

CTG TAC CAG CTG TCC CTG CAG CGG GAG CCG CGC TCC AAG TCC TCG CCA 153 5 

Leu Tyr Gin Leu Ser Leu Gin Arg Glu Pro Arg Ser Lys Ser Ser Pro 
500 505 510 

ACC AGC CCC ACG AGT TGC ACC CCA CCA CCC CGG CCC CCG GTA CTG GAG 1583 
Thr Ser Pro Thr Ser Cys Thr Pro Pro Pro Arg Pro Pro Val Leu Glu 
515 520 525 

GAG TGG ACC TCG GCT GCC AAA CCC AAG CTG GAT CAG GCC CTC GTG GTG 1631 
Glu Trp Thr Ser Ala Ala Lys Pro Lys Leu Asp Gin Ala Leu Val Val 
530 535 540 

GAG CAC ATC GAG AAG ATG GTG GAG TCT GTG TTC CGG AAC TTT GAC GTC 1679 
Glu His He Glu Lys Met Val Glu Ser Val Phe Arg Asn Phe Asp Val 
545 550 555 

GAT GGG GAT GGC CAC ATC TCA CAG GAA GAA TTC CAG ATC ATC CGT GGG 172 7 

Asp Gly Asp Gly His He Ser Gin Glu Glu Phe Gin He He Arg Glv 
560 565 570 575 

AAC TTC CCT TAC CTC AGC GCC TTT GGG GAC CTC GAC CAG AAC CAG GAT 1775 
Asn Phe Pro Tyr Leu Ser Ala Phe Gly Asp Leu Asp Gin Asn Gin Asp 
580 5B5 590 

GGC TGC ATC AGC AGG GAG GAG ATG GTT TCC TAT TTC CTG CGC TCC AGC 182 3 

Gly Cys He Ser Arg Glu Glu Met Val Ser Tyr Phe Leu Arg Ser Ser 
595 600 605 

TCT GTG TTG GGG GGG CGC ATG GGC TTC GTA CAC AAC TTC CAG GAG AGC 1871 
Ser Val Leu Gly Gly Arg Met Gly Phe Val His Asn. Phe Gin Glu Ser > 
610 615 620 

AAC TCC TTG CGC CCC GTC GCC TGC CGC CAC TGC AAA GCC CTG ATC CTG 1919 
Asn Ser Leu Arg Pro Val Ala Cys Arg His Cys Lys Ala Leu He Leu 
625 630 635 

GGC ATC TAC AAG CAG GGC CTC AAA TGC CGA GCC TGT GGA GTG AAC TGC 
Gly He Tyr Lys Gin Gly Leu Lys Cys Arg Ala Cys Gly Val Asn Cvs 
640 645 650 655 

CAC AAG CAG TGC AAG GAT CGC CTG TCA GTT GAG TGT CGG CGC AGG GCC 2 015 

His Lys Gin Cys Lys Asp Arg Leu Ser Val Glu Cys Arg Arg Arg Ala 
660 665 670 

CAG AGT GTG AGC CTG GAG GGG TCT GCA CCC TCA CCC TCA CCC ATG CAC 
Gin Ser Val Ser Leu Glu Gly Ser Ala Pro Ser Pro Ser Pro Met His 
675 680 685 

AGC CAC CAT CAC CGC GCC TTC AGC TTC TCT CTG CCC CGC CCT GGC AGG 
Ser His His His Arg Ala Phe Ser Phe Ser Leu Pro Arg Pro Glv Ara 
€90 695 700 

CGA GGC TCC AGG CCT CCA GAG ATC CGT GAG GAG GAG GTA CAG ACG GTG 
Arg Gly Ser Arg Pro Pro Glu lie Arg Glu Glu Glu Val Gin Thr Val 
705 710 715 

GAG GAT GGG GTG TTT GAC ATC CAC TTG TA ATAGATGCTG TGGTTGGATC 
Glu Asp Gly Val Phe Asp He His Leu 
720 725 

AAGGACTCAT TCCTGCCTTG GAGAAAATAC TTCAACCAGA GCAGGGAGCC TGGGGGTGTC 
GGGGCAGGAG GCTGGGGATG GGGGTGGGAT ATGAGGGTGG CATGCAGCTG AGGGCAGGGC 



1967 



2063 

2111 

2159 

2208 

2268 
2328 
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CAGGGCTGGT GTCCCTAAGG TTGTACAGAC TCTTGTGAAT ATTTGTATTT TCCAGATGGA 2388 
ATAAAAAGGC CCGTGTAATT AACCTTC 2415 

( 2 ) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

lie Ser Phe Leu Ala Pro His Arg Ser Leu Ser Pro Lys Tyr Ser His 
1. 5 10 15 

Leu Val Leu Ala His Pro Pro Asp Tyr Leu Lys Asp Gin Leu Ser Pro 
20 25 30 

Arg Pro Arg Pro Pro Leu Gly Leu Cys His Pro Leu Pro Ala Gly Arg 
35 40 45 

Arg Pro Val Pro Gly Arg Val Ser Pro Met Gly Thr Gin Arg Leu Cys 
50 55 60 

Gly Arg Gly Thr Gin Gly Trp Pro Gly Ser Ser Glu Gin His Val Gin 
65 70 75 80 

Glu Ala Thr Ser Ser Ala Gly Leu His Ser Gly Val Asp Glu Leu Gly 
85 90 .95 

Val Arg Ser Glu Pro Gly Gly Arg Leu Pro Glu Arg Ser Leu Gly Pro 
100 105 110 

Ala His Pro Ala Pro Ala Ala Met Ala Gly Thr Leu Asp Leu Asp Lys 
115 120 125 

Gly Cys Thr Val Glu Glu Leu Leu Arg Gly Cys lie Glu Ala Phe Asp 
130 135 140 

Asp Ser Gly Lys Val Arg Asp Pro Gin Leu Val Arg Met Phe Leu Met 
145 150 155 160 

Met His Pro Trp Tyr lie Pro Ser Ser Gin Leu Ala Ala Lys Leu Leu 
165 170 175 

His lie Tyr Gin Gin Ser Arg Lys Asp Asn Ser Asn Ser Leu Gin Val 
180 185 190 

Lys Thr Cys His Leu Val Arg Tyr Trp lie Ser Ala Phe Pro Ala Glu 
195 200 205 

Phe Asp Leu Asn Pro Glu Leu Ala Glu Gin lie Lys Glu Leu Lys Ala 
210 215 220 

Leu Leu Asp Gin Glu Gly Asn Arg Arg His Ser Ser Leu lie Asp lie 
225 230 235 240 

Asp Ser Val Pro Thr Tyr Lys Trp Lys. Arg Gin Val Thr Gin Arg Asn 
245 250 255 

Pro Val Gly Gin Lys Lys Arg Lys Met Ser Leu Leu Phe Asp His Leu 
260 265 270 
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Glu Pro Met Glu Leu Ala Glu His Leu Thr Tyr Leu Glu* Tyr Arg Ser 
275 280 285 

Phe Cys Lys lie Leu Phe Gin Asp Tyr His Ser Phe Val Thr His Gly 
290 295 300 

Cys Thr Val Asp Asn Pro Val Leu Glu Arg Phe lie Ser Leu Phe Asn 
305 310 315 320 

Ser Val Ser Gin Trp Val Gin Leu Met He Leu Ser Lys Pro Thr Ala 
325 330 335 

Pro Gin Arg Ala Leu Val He Thr His Phe Val His Val Ala Glu Lys 
340 345 350 

Leu Leu Gin Leu Gin Asn Phe Asn Thr Leu Met Ala Val Val Gly Gly 
355 360 365 

Leu Ser His Ser Ser He Ser Arg Leu Lys Glu Thr His Ser His Val 
370 375 380 

Ser, Pro Glu Thr He Lys Leu Trp Glu Gly Leu Thr Glu Leu Val Thr 
385 390 395 400 

Ala Thr Gly Asn Tyr Gly Asn Tyr Arg Arg Arg Leu Ala Ala Cys Val 
405 410 415 

Gly Phe Arg Phe Pro He Leu Gly Val His Leu Lys Asp Leu Val Ala 
420 425 430 

Leu Gin Leu Ala Leu Pro Asp Trp Leu Asp Pro Ala Arg Thr Arg Leu 
435 440 445 

Asn Gly Ala Lys Met Lys Gin Leu Phe Ser He Leu Glu Glu Leu Ala 
450 .455 460 

Met Val Thr Ser Leu Arg Pro Pro Val Gin Ala Asn Pro Asp Leu Leu 
465 470 475 480 

Ser Leu Leu Thr Val Ser Leu Asp Gin Tyr Gin Thr Glu Asp Glu Leu 
485 490 495 

Tyr Gin Leu Ser Leu Gin Arg Glu Pro Arg Ser Lys Ser Ser Pro Thr 
500 505 510 

Ser Pro Thr Ser Cys Thr Pro Pro Pro Arg Pro Pro Val Leu Glu Glu 
515 520 525 

Trp Thr Ser Ala Ala Lys Pro Lys Leu Asp Gin Ala Leu Val Val Glu 
530 535 540 

His He Glu Lys Met Val Glu Ser Val Phe Arg Asn Phe Asp Val Asd 
545 550 555. 560 

Gly Asp Gly His He Ser Gin Glu Glu Phe Gin lie He Arg Gly Asn 
565 570 575 

Phe Pro Tyr Leu Ser Ala Phe Gly Asp Leu Asp Gin Asn Gin Asp Gly 
580 585 590 

Cys He Ser Arg Glu Glu Met Val Ser Tyr Phe Leu Arg Ser Ser Ser 
595 600 605 

Val Leu Gly Gly Arg Met Gly Phe Val His Asn Phe Gin Glu Ser Asn 
610 615 $20 

Ser Leu Arg Pro Val Ala Cys Arg His Cys Lys Ala Leu He Leu Gly 
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625 630 ' 635 640 

lie Tyr Lys Gin Gly Leu Lys Cys Arg Ala Cys Gly Val Asn Cys His 
645 650 655 

Lys Gin Cys Lys Asp Arg Leu Ser Val Glu Cys Arg Arg Arg Ala Gin 
660 665 670 

Ser Val Ser Leu Glu Gly Ser Ala Pro Ser Pro Ser Pro Met His Ser 
675 680 685 

His His His Arg Ala Phe Ser Phe Ser Leu Pro Arg Pro Gly Arg Arg 
690 695 700 

Gly Ser. Arg Pro Pro Glu lie Arg Glu Glu Glu Val Gin Thr Val Glu 
705 710 715 720 

Asp Gly Val Phe Asp lie His Leu 
725 



(2 J INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 09 base pairs 

(B) TYPE; nucleic acid 

( C ) STRAND ED NESS : s ingl e 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: DNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 254 . . *0-83' 2<>$> 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 



CGATTTCATT CCTCGCTCCC CACAGGTCCC TCTCCCCAAA ATATTCCCAT CTTGTCCTAG 60 

CCCATCCCCC AGACTATCTC AAGGACCAGC TGTCCCCACG CCCCCGACCT CCACTAGGCC 120 

TGTGCCACCC GCTGCCTGCA GGAAGACGCC CGGTCCCGGG CCGGGTTAGC CCCATGGGAA 180 

CGGGGTTCGG TCCGAGCCCG GTGGGAGGCT CCCGGAGCGC AGCCTGGGCC CAGCCCACCC 240 

CGCGCCGGCG GCC ATG GCA GGC ACC CTG GAC CTG GAC AAG GGC TGC ACG 2 89 

Met Ala Gly Thr Leu Asp Leu Asp Lys Gly Cys Thr 
1 5 10 

GTG GAG GAG CTG CTC CGC GGG TGC ATC GAA GCC TTC GAT GAC TCC GGG 337 
Val Glu Glu Leu Leu Arg Gly Cys lie Glu Ala Phe Asp Asp Ser Gly 
15 20 25 

AAG GTG CGG GAC CCG CAG CTG GTG CGC ATG TTC CTC ATG ATG CAC CCC 3 85 

Lys Val Arg Asp Pro Gin Leu Val Arg Met Phe Leu Met Met His Pro 
30 .35 40 

TGG TAC ATC CCC TCC TCT CAG CTG GCG GCC AAG CTG CTC CAC ATC TAC 43 3 

Trp Tyr lie Pro Ser Ser Gin Leu Ala Ala Lys Leu Leu His lie Tvr 
45 50 55 60 

CAA CAA TCC CGG AAG GAC AAC TCC AAT TCC CTG CAG GTG AAA ACG TGC 4 81 

Gin Gin Ser Arg Lys Asp Asn Ser Asn Ser Leu Gin Val Lys Thr Cys 
65 .70 75 

CAC CTG GTC AGG TAC TGG ATC TCC GCC TTC CCA GCG GAG TTT GAC TTG 52 9 
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His Leu Val Arg Tyr Trp lie Ser Ala Phe Pro Ala Glu Phe Asp Leu 
80 85 90 

AAC CCG GAG TTG GCT GAG CAG ATC AAG GAG CTG AAG GCT CTG CTA GAC 577 
Asn Pro Glu Leu Ala Glu Gin lie Lys Glu Leu Lys Ala Leu Leu Asp 
95 100 105 

CAA GAA GGG AAC CGA CGG CAC AGC AGC CTA ATC GAC ATA GAC AGC GTC 62 5 

Gin Glu Gly Asn Arg Arg His Ser Ser Leu He Asp He Asp Ser Val 
110 115 120 

CCT ACC TAG AAG TGG AAG CGG CAG GTG ACT CAG CGG AAC CCT GTG GGA 673 
Pro Thr Tyr Lys Trp Lys Arg Gin Val Thr Gin Arg Asn Pro Val Gly 
125 130 135 140 

CAG AAA AAG CGC AAG ATG TCC CTG TTG TTT GAC CAC CTG GAG CCC ATG 721 
Gin Lys Lys Arg Lys Met Ser Leu Leu Phe Asp His Leu Glu Pro Met 

145 150 155 , 

GAG CTG GCG GAG CAT CTC ACC TAC TTG GAG TAT CGC TCC TTC TGC AAG 7 69 

Glu Leu Ala Glu His Leu Thr Tyr Leu Glu Tyr Arg Ser Phe Cys Lys 
160 165 170 

ATC CTG TTT CAG GAC TAT CAC AGT TTC GTG ACT CAT GGC TGC ACT GTG 817 
He Leu Phe Gin Asp Tyr His Ser Phe Val Thr His Gly Cys Thr Val 
175 180 185 

GAC AAC CCC GTC CTG GAG CGG TTC ATC TCC CTC TTC AAC AGC GTC TCA 865 
Asp Asn Pro Val Leu Glu Arg Phe He Ser Leu Phe Asn Ser Val Ser 
190 195 200 

CAG TGG GTG CAG CTC ATG ATC CTC AGC AAA CCC ACA GCC CCG CAG CGG 913 
Gin Trp Val Gin Leu Met He Leu Ser Lys Pro Thr Ala Pro Gin Arg 
205 210 215 220 

GCC CTG GTC ATC ACA CAC TTT GTC CAC GTG GCG GAG AAG CTG CTA CAG 961 
Ala Leu Val lie Thr His Phe Val His Val Ala Glu Lys Leu Leu Gin 
225 230 235 

CTG CAG AAC TTC AAC ACG CTG ATG GCA GTG GTC GGG GGC CTG AGC CAC 1009 
Leu Gin Asn Phe Asn Thr Leu Met Ala Val Val Gly Gly Leu Ser His 
240 245 250 

AGC TCC ATC TCC CGC CTC AAG GAG ACC CAC AGC CAC GTT AGC CCT GAG 1057 
Ser Ser He Ser Arg Leu Lys Glu Thr His Ser His Val Ser Pro Glu 
255 260 265 

ACC ATC AAG CTC TGG GAG GGT CTC ACG GAA CTA GTG ACG GCG ACA GGC 1105 
Thr He Lys Leu Trp Glu Gly Leu Thr Glu Leu Val Thr Ala Thr Glv 
270 275 280 

AAC TAT GGC AAC TAC CGG CGT CGG CTG GCA GCC TGT GTG GGC TTC CGC 115 1 

Asn Tyr Gly Asn Tyr Arg Arg Arg Leu Ala Ala Cys Val Gly Phe Aro 
285 290 295 300 

TTC CCG ATC CTG GGT GTG CAC CTC AAG GAC CTG GTG GCC CTG CAG CTG 12 01 

Phe Pro He Leu Gly Val His Leu Lys Asp Leu Val Ala Leu Gin Leu 
305 310 315 

GCA CTG CCT GAC TGG CTG GAC CCA GCC CGG ACC CGG CTC AAC GGG GCC 12 4 9 

Ala Leu- Pro Asp Trp Leu Asp Pro Ala Arg Thr Arg Leu Asn Gly Ala 
320 325 330 

AAG ATG AAG CAG CTC TTT AGC ATC CTG GAG GAG CTG GCC ATG GTG ACC 12 97 

Lys Met Lys Gin Leu Phe Ser He Leu Glu Glu Leu Ala Met Val Thr 
335 340 345 
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AGC CTG CGG CCA CCA GTA CAG GCC AAC CCC GAC CTG CTG AGC CTG CTC 1345 
Ser Leu Arg Pro Pro Val Gin Ala Asn Pro Asp Leu Leu Ser Leu Leu 
350 ' 355 360 

ACG GTG TCT CTG GAT CAG TAT CAG ACG GAG GAT GAG CTG TAC CAG CTG 1393 
Thr Val Ser Leu Asp Gin Tyr Gin Thr Glu Asp Glu Leu Tyr Gin Leu 
365 370 375 380 

TCC CTG CAG CGG GAG CCG CGC TCC AAG TCC TCG CCA ACC AGC CCC ACG 1441 
Ser Leu Gin Arg Glu Pro Arg Ser Lys Ser Ser Pro Thr Ser Pro Thr 
385 390 395 

AGT TGC ACC CCA CCA CCC CGG CCC CCG GTA CTG GAG GAG TGG ACC TCG 1489 
Ser Cys Thr Pro Pro Pro Arg Pro Pro Val Leu Glu Glu Trp Thr Ser 
400 405 410 

GCT GCC AAA CCC AAG CTG GAT CAG GCC CTC GTG GTG GAG CAC ATC GAG 1537 
Ala Ala Lys Pro Lys Leu Asp Gin Ala Leu Val Val Glu His lie Glu 
415 420 425 

AAG ATG GTG GAG TCT GTG TTC CGG AAC TTT GAC GTC GAT GGG GAT GGC 1585 
Lys Met Val Glu Ser Val Phe Arg Asn Phe Asp Val Asp Gly Asp Gly 
430 435 440 

CAC ATC TCA CAG GAA GAA TTC CAG ATC ATC CGT GGG AAC TTC CCT TAC 1633 
His lie Ser Gin Glu Glu Phe Gin He He Arg Gly Asn Phe Pro Tyr 
445 450 455 460 

CTC AGC GCC TTT GGG GAC CTC GAC CAG AAC CAG GAT GGC TGC ATC AGC 1681 
Leu Ser Ala Phe Gly Asp Leu Asp Gin Asn Gin Asp Gly Cys lie Ser 
465 470 475 

AGG GAG GAG ATG GTT TCC TAT TTC CTG CGC TCC AGC TCT GTG TTG GGG 1729 
Arg Glu Glu Met Val Ser Tyr Phe Leu Arg Ser Ser Ser Val Leu Gly 
480 485 490 

GGG CGC ATG GGC TTC GTA CAC AAC TTC CAG GAG AGC AAC TCC TTG CGC 1777 
Gly Arg Met Gly Phe Val His Asn Phe Gin Glu Ser Asn Ser Leu Arg 
495 .500 505 

CCC GTC GCC TGC CGC CAC TGC AAA GCC CTG ATC CTG GGC ATC TAC AAG 182 5 

Pro Val Ala Cys Arg His Cys Lys Ala Leu He Leu Gly He Tyr Lys 
510 515 520 

CAG GGC CTC AAA TGC CGA GCC TGT GGA GTG AAC TGC CAC AAG CAG TGC 1873 
Gin Gly Leu Lys Cys Arg Ala Cys Gly Val Asn Cys His Lys Gin Cys 
525 530 535 540 

AAG GAT CGC CTG TCA GTT GAG TGT CGG CGC AGG GCC CAG AGT GTG AGC 1921 
Lys Asp Arg Leu Ser Val Glu Cys Arg Arg Arg Ala Gin Ser Val Ser 
545 550 555 

CTG GAG GGG TCT GCA CCC TCA CCC TCA CCC ATG CAC AGC CAC CAT CAC 1969 
Leu Glu Gly Ser Ala Pro Ser Pro Ser Pro Met His Ser His His His 
560 565 570 

CGC GCC TTC AGC TTC TCT CTG CCC CGC CCT GGC AGG CGA GGC TCC AGG 2 017 

Arg Ala Phe Ser Phe Ser Leu Pro Arg Pro Gly Arg Arg Gly Ser Arg 
575 580 585 

CCT CCA GAG ATC CGT GAG GAG GAG GTA CAG ACG GTG GAG GAT GGG GTG 2 0 65 

Pro Pro Glu He Arg Glu Glu Glu Val Gin Thr Val Glu Asp Gly Val 
590 595 600 

TTT GAC ATC CAC TTG TAATAGATGC TGTGGTTGGA TCAAGGACTC ATTCCTGCCT 212 0 

Phe Asp He His Leu 
605 610 
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TGGAGAAAAT ACTTCAACCA GAGCAGGGAG CCTGGGGGTG TCGGGGCAGG AGGCTGGGGA 2180 

TGGGGGTGGG ATATGAGGGT GGCATGCAGC TGAGGGCAGG GCCAGGGCTG GTGTCCCTAA 224 0 

GGTTGTACAG ACTCTTGTGA ATATTTGTAT TTTCCAGATG GAATAAAAAG GCCCGTGTAA 2300 

TTAACCTTC 2309 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 609 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 

Met Ala Gly Thr Leu Asp Leu Asp Lys Gly Cys Thr Val Glu Glu Leu 
1 5 10 15 

Leu Arg Gly Cys He Glu Ala Phe Asp Asp Ser Gly Lys Val Arg Asp 
20 25 30. 

Pro Gin Leu Val Arg Met Phe Leu Met Met His Pro Trp Tyr He Pro 
35 40 45 

Ser Ser Gin Leu Ala Ala Lys Leu Leu His He Tyr Gin Gin Ser Arg 
50 55 60 

Lys Asp Asn Ser Asn Ser Leu Gin Val Lys Thr Cys His Leu Val Arg 
65 .70 75 80 

Tyr Trp He Ser Ala Phe Pro Ala Glu Phe Asp Leu Asn Pro Glu Leu 

,85 90 . 95 

Ala Glu Gin He Lys Glu Leu Lys Ala Leu Leu Asp Gin Glu Gly Asn 
100 105 110 

Arg Arg His Ser Ser Leu lie Asp He Asp Ser Val Pro Thr Tyr Lys 
115 120 125 

Trp Lys Arg Gin Val Thr Gin Arg Asn Pro Val Gly Gin Lys Lys Arg 
130 135 140 

Lys Met Ser Leu Leu Phe Asp His Leu Glu Pro Met Glu Leu Ala Glu 
145 150 155 160 

His Leu Thr Tyr Leu Glu Tyr Arg Ser Phe Cys Lys He Leu Phe Gin 
165 170 175 

Asp Tyr His Ser Phe Val Thr His Gly Cys Thr Val Asp Asn Pro Val 
180 185 , 190 

Leu Glu Arg Phe He Ser Leu Phe Asn Ser Val Ser Gin Trp Val Gin 
195 200 205 

Leu Met He Leu Ser Lys Pro Thr Ala Pro Gin Arg Ala Leu Val He 
210 215 220 

Thr His Phe Val His Val Ala Glu Lys Leu Leu Gin Leu Gin Asn Phe 
225 230 235 240 

Asn Thr Leu Met Ala Val Val Gly Gly Leu Ser His Ser Ser He Ser 
245 250 255 
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Arg Leu Lys Glu Thr His Ser His Val Ser Pro Glu Thr lie Lys Leu 
260 265 

Trp Glu Gly Leu Thr Glu Leu Val Thr Ala Thr Gly Asn Tyr Gly Asn 
275 280 285 

Tyr Arg Arg Arg Leu Ala Ala Cys Val Gly Phe Arg Phe Pro lie Leu 
290 295 300 

Gly Val His Leu Lys Asp Leu Val Ala Leu Gin Leu Ala Leu Pro Asp 
305 310 315 320 

Trp Leu Asp Pro Ala Arg Thr Arg Leu Asn Gly Ala Lys Met Lys Gin 
325 330 335 

Leu Phe Ser lie Leu Glu Glu Leu Ala Met Val Thr Ser Leu Arg Pro 
340 345 350 

Pro Val Gin Ala Asn Pro Asp Leu Leu Ser Leu Leu Thr Val Ser Leu 
355 360 365 

Asp Gin Tyr Gin Thr Glu Asp Glu Leu Tyr Gin Leu Ser Leu Gin Arg 
370 375 380 

Glu Pro Arg Ser Lys Ser Ser Pro Thr Ser Pro Thr Ser Cys Thr Pro 
385 390 395 .400 

Pro Pro Arg Pro Pro Val Leu Glu Glu Trp Thr Ser Ala Ala Lys Pro 
405 410 415 

Lys Leu Asp Gin Ala Leu Val Val Glu His He Glu Lys Met Val Glu 
420 425 430 

Ser Val Phe Arg Asn Phe Asp Val Asp Gly Asp Gly His He Ser Gin 
435 440 445 

Glu Glu Phe Gin He He Arg Gly Asn Phe Pro Tyr Leu Ser Ala Phe 
450 455 460 

Gly Asp Leu Asp Gin Asn Gin Asp Gly Cys He Ser Arg Glu Glu Met 
465 470 475 480 

Val Ser Tyr Phe Leu Arg Ser Ser Ser Val Leu Gly Gly Arg Met Gly 
485 490 495 

Phe Val His Asn Phe Gin Glu Ser Asn Ser Leu Arg Pro Val Ala Cys 
500 505 510 

Arg His Cys Lys Ala Leu He Leu Gly He Tyr Lys Gin Gly Leu Lys 
515 520 525 

Cys Arq Ala Cys Gly Val Asn Cys His Lys Gin Cys Lys Asp Arg Leu 
530 535 540 

Ser Val Glu Cys Arg Arg Arg Ala Gin Ser Val Ser Leu Glu Gly Ser 
545 550 555 560 

Ala Pro Ser Pro Ser Pro Met His Ser His His His Arg Ala Phe Ser 
565 570 575 

Phe Ser Leu Pro Arg Pro Gly Arg Arg Gly Ser Arg Pro Pro Glu He 
580 585 590 

Arg Glu Glu Glu Val Gin Thr Val Glu Asp Gly Val Phe Asp He His 
595 600 605 

Leu 
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(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 11.. 73 3 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 

GCCCGCCGCC ATG CCG CCC TTA CTG CCC CTG CGC CTG TGC CGG CTG TGG 49 
Met Pro Pro Leu Leu Pro Leu Arg Leu Cys Arg Leu Trp 
1 5 10 

CCC CGC AAC CCT CCC TCC CGG CTC CTC GGA GCG GCC GCC GGG CAG CGG 97 
Pro Arg Asn Pro Pro Ser Arg Leu Leu Gly Ala Ala Ala Gly Gin Arg 
15 20 25 

TCC AGA CCC AGT ACT TAT TAT GAA CTG TTG GGG GTG CAT CCT GGT GCC 145 
Ser Arg Pro Ser Thr Tyr Tyr Glu Leu Leu Gly Val His Pro Gly Ala 
30 35 40 45 

AGC ACT GAG GAA GTT AAA CGA GCT TTC TTC TCC AAG TCC AAA GAG CTG 193 
Ser Thr Glu Glu Val Lys Arg Ala Phe Phe Ser Lys Ser Lys Glu Leu 
50 55 60 

CAC CCA GAC CGG GAC CCT GGG AAC CCA AGC CTG CAC AGC CGC TTT GTG 241 
His Pro Asp Arg Asp Pro Gly Asn Pro Ser Leu His Ser Arg Phe Val 
,65 70 75 

GAG CTG AGC GAG GCA TAC CGT GTG CTC AGC CGT GAG CAG AGC CGC CGC 2 89 

Glu Leu Ser Glu Ala Tyr Arg Val Leu Ser Arg Glu Gin Ser Arg Arg 
80 85 90 

AGC TAT GAT GAC CAG CTC CGC TCA GGT AGT CCC CCA AAG TCT CCA CGA 337 
Ser Tyr Asp Asp Gin Leu Arg Ser Gly Ser Pro Pro Lys Ser Pro Arg 
95 100 105 

ACC ACA GTC CAT GAC AAG TCT GCC CAC CAA ACA CAC AGC TCC TGG ACA 3 85 

Thr Thr Val His Asp Lys Ser Ala His Gin Thr His Ser Ser Trp Thr 
110 115 120 125 

CCC CCC AAC GCA CAG TAC TGG TCC CAG TTT CAC AGC GTG AGG CCA CAG 433 
Pro Pro Asn Ala Gin Tyr Trp Ser Gin Phe His Ser Val Arg Pro Gin 
130 135 140 

GGG CCC CAG TTG AGG CAG CAG CAA CAC AAA CAA AAC AAA CAA GTG CTG 4 81 

Gly Pro Gin Leu Arg Gin Gin Gin His Lys Gin Asn Lys Gin Val Leu 
145 150 155 

GGG TAC TGC CTC CTC CTC ATG CTG GCG GGC ATG GGC CTG CAC TAC ATT 529 
Gly Tyr Cys Leu Leu Leu Met Leu Ala Gly Met Gly Leu His Tyr lie 
160 165 170 

GCC TTC AGG AAG GTG AAG CAG ATG CAC CTT AAC TTC ATG GAT GAA AAG 57 7 

Ala Phe Arg Lys Val Lys Gin Met His Leu Asn Phe Met Asp Glu Lys 
175 180 185 
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GAT CGG ATC ATC ACA GCC TTC TAC AAC GAA GCC CGG GCA CGG GCC AGG 625 
Asp Arg lie lie Thr Ala Phe Tyr Asn Glu Ala Arg Ala Arg Ala Arg 
190 195 200 205 

GCC AAC AGA GGC ATC CTT CAG CAG GAG CGA CAA CGG CTA GGG CAG CGG 673 
Ala Asn Arg Gly lie Leu Gin Gin Glu Arg Gin Arg Leu Gly Gin Arg 
210 215 220 

CAG CCG CCA CCA TCC GAG CCA ACC CAA GGC CCC GAG ATC GTG CCC CGG 721 
Gin Pro Pro Pro Ser Glu Pro Thr Gin Gly Pro Glu lie Val Pro Arg 
225 230 235 

GGC GCC GGC CCC TGA GGGGCTC ACCTGGATGG GGCCTGCAGT GCGTTCCCGC 773 
Gly Ala Gly Pro * 
240 

TTTGCTTCCT TCCCTGGACG GCCCGCTCCC CGAAACGCGC GCAATAAAGT GATTCGCAG 832 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 241 amino acids 

(B) TYPE: amino acid 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE; protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

Met Pro Pro Leu Leu Pro Leu Arg Leu Cys Arg Leu Trp Pro Arg Asn 
1 5 10 15 

Pro Pro Ser Arg Leu Leu Gly Ala Ala Ala Gly Gin Arg Ser Arg Pro 
20 25 30 

Ser Thr Tyr Tyr Glu Leu Leu Gly Val His Pro Gly Ala Ser Thr Glu 
35 40 45 

Glu Val Lys Arg Ala Phe Phe Ser Lys Ser Lys Glu Leu His Pro Asp 
50 55 60 

Arg Asp Pro Gly Asn Pro Ser Leu His Ser Arg Phe Val Glu Leu Ser 
65 70 75 80 

Glu Ala Tyr Arg Val Leu Ser Arg Glu Gin Ser Arg Arg Ser Tyr Asp 
85 90 95 

Asp Gin Leu Arg Ser Gly Ser Pro Pro Lys Ser Pro Arg Thr Thr Val 
100 105 110 

His Asp Lys Ser Ala His Gin Thr His Ser Ser Trp Thr Pro Pro Asn 
115 120 125 

Ala Gin Tyr Trp Ser Gin Phe His Ser Val Arg Pro Gin Gly Pro Gin 
130 135 140 

Leu Arg Gin Gin Gin His Lys Gin Asn Lys Gin Val Leu Gly Tyr Cys 
145 150 155 160 

Leu Leu Leu Met Leu Ala Gly Met Gly Leu His Tyr He Ala Phe Arg 
165 170 175 

Lys Val Lys Gin Met His Leu Asn Phe Met Asp Glu Lys Asp Arg He 
180 185 190 

He Thr Ala Phe Tyr Asn Glu Ala Arg Ala Arg Ala Arg Ala Asn Arg 
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195 



200 



205 



Gly He Leu Gin Gin Glu Arg Gin Arg Leu Gly Gin Arg Gin Pro Pro 
210 215 220 

Pro Ser Glu Pro Thr Gin Gly Pro Glu He Val Pro Arg Gly Ala Glv 
225 230 235 240 

Pro 



(2) INFORMATION FOR SEQ ID NO: 7; 



(i) SEQUENCE CHARACTERISTICS: 

(aKleNGTH: 300 base pairs 

(B) ^TPE; nucleic acid 

(C) StUJANDEDNESS : single 

(D) TOPOLOGY : linear 



(ii) MOLECULE 



DNA 



(ix) FEATURE: 

(A) NAME/ KEY: 

(B) LOCATION: 111 



.300 



(xi) SEQUENCE DESCRIPTION :^SEQ ID NO: 7: 
CGATTTCATT CCTCGCTCCC CACAGGTCCC TCTCCCCj 
CCCATCCCCC AGACTATCTC AAGGACCAGC TGTQCC2CACG 
TGTGCCACCC GCTGCCTGCA GGAAGACGCC CGG^C^GGG 



GGG AAC GGG GTT CGG TCC GAG COf GGT GGG AGG 
Gly Asn Gly Val Arg Ser Glu Bro Gly Gly Arg 
5 SlQ 

CTG GGC CCA GCC CAC CCC QCG CCG GCG GCC ATG 
Leu Gly Pro Ala His Pro/Ala Pro Ala Ala Met 
20 / 25 

CTG GAC AAG GGC TGc/acG GTG GAG GAG CT 
Leu Asp Lys Gly cfs Thr Val Glu Glu Leu 
35 / 40 



ATATTCCCAT CTTGTCCTAG 

CCCCCGACCT CCACTAGGCC 

CCGGGTTAG CCC CAT 
Pro His 
1 

$C CCG GAG CGC AGC 
Le\Pro Glu Arg Ser 
J.5 

GCA GG^ACC CTG GAC 
Ala Gly\hr Leu Asp 
30 



223 



271 



300 



(2) INFOHl«(TION FOR SEQ ID NO : 8 : 

.) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 
{ B) TYPE; amino acid 
(D) TOPOLOGY: linear 



WO 98/53061 



-75- 



PCI7AU9S/00380 



Pro 
1 



i^s Gly.Asn Gly Val Arg Ser Glu Pro Gly Gly Arg Leu Pro Gli 



10 



15 



Arg Ser Leu 



Pro Ala His Pro Ala Pro Ala Ala Met Ala Thr 
25 



Leu Asp Leu Asp Lys 
35 



Cys Thr Val Glu Glu Leu, 
40 



(2) INFORMATION FOR SEQ ID NO: 9: 



(i) SEQUENCE CHARACTERISTIC 

(A) LENGTH: 15 has 

(B) TYPE: nucleip^acid 

(C) STRANDEDNESS: single 

(D) TOPOLOC^T linear 



(ii) MOLECULE 



fPE: DNA 



(xLT SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



(2) INFORMATION FOR SEQ ID NO: 10: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(xij SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Asp Val Asp Glu Glu Asp Glu Val Glu Asp lie Glu Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; Peptide 



(xi) .SEQUENCE DESCRIPTION: SEQ ID NO:ll: 

Asp Val Asp Gly Asp Gly His lie Ser Gin Glu Glu Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 13 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(xi) SEQUENCE DESCRIPTION: SEQ 10 NO: 12: 

Asp His Asp Arg Asp Gly Phe lie Ser Gin Glu Glu Phe 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asp Gin Asn Gin Asp Gly Cys lie Ser Arg Glu Glu Met 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l«: 

Asp Val Asp Met Asp Gly Gin lie Ser Lys Asp Glu Leu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

His Phe Val His Val Ala Glu Lys Leu Leu Gin Leu Gin Asn Phe Asn 
1 5 10 15 

Thr Leu Met Ala Val Val Gly Gly Leu Ser His Ser Ser He Ser Arg 
20 25 30 
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Leu Lys Glu Thr His 
35 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 37 amino acids 

(B) TYPE: amino acid 

( C ) S TRANDEDNESS : s ing 1 e 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

Lys Phe Val His Val Ala Lys His Leu Arg Lys lie Asn Asn Phe Asn 
1 5 10 15 

Thr Leu Met Ser- Val Val Gly Gly He Thr His Ser Ser Val Ala Arg 
20 25 30 

Leu Ala Lys Thr Tyr 
35 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 
(5) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

His Asn Phe Gin Glu Ser Asn Ser Leu Arg Pro Val Ala Cys Arg His 
15 10 15 

Cys Lys Ala Leu He Leu Gly He Tyr Lys Gin Gly Leu Lys Cys Arg 
20 25 30 

Ala Cys Gly Val Asn Cys His Lys Gin Cys Lys Asp Arg Leu Ser Val 
35 40 . 45 

Glu Cys 
50 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

(ii) MOLECULE TYPE: Peptide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

His Asn Phe His Glu Thr Thr Phe Leu Thr Pro Thr Thr Cys Asn His 
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15 10 15 

Cys Asn Lys Leu Leu Trp Gly He Leu Arg Gin Gly Phe Lys Cys Lys 
20 25 30 

Asp Cys Gly Leu Ala Val His Ser Cys Cys Lys Ser Asn Ala.Val Ala 
35 40 45 

Glu Cys 
50 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGGATCCCCC TGGTC 15 
(2) INFORMATION FOR. SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
GAATTCGGCA CGAGCCGACG G 21 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
ATGGAGCAGA AGCTGATCTC CGAGGAGGAC CTGCCCGGGG CAGCTGGATC CGCAGCCCAC 60 
CCCGCGCCGG CGGCCATG 78 
(2) INFORMATION FOR SEQ ID NO: 22: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

Met Glu Gin Lys Leu lie Ser Glu Glu Asp Leu Pro Gly Ala Ala Gly 
1 5 10 15 

Ser Ala Ala His Pro Ala Pro Ala Ala Met 
20 25 



(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
GGATCCGCAG CCCACCCCGC GCCGGCGGCC ATG 33 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

.(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

Gly Ser Ala Ala His Pro Ala Pro Ala Ala Met 
5 10 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25 
GGACAAAGTG TGTGATGAAC C 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 
CTCATCCTCC GTCTGATACT G 
(2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE; DNA 



(xij SEQUENCE DESCRIPTION: SEQ ID NO:27: 
GTAGATGTGG ATCAGCTTGG 
(2) INFORMATION TOR SEQ ID NO: 28; . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE; nucleic acid 
CO STRANDEDNESS: single 
( D ) TO POLOGY : 1 i ne a r 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQIDNO:28: 
AGGTGGAGAA TGGTCAAGG 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 29: 
GTCATAGTCT GTCTCCTACT 
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{2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
ACATAGACAG CGTGCCTACC 
(2) INFORMATION FOR SEQ ID NO:3l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TO POLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31 

TAC AACCTTA GGGACACCAG 

(2) INFORMATION FOR SEQ ID NO:32: 

(i)' SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 2G base pairs 
, (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TGCTGAGCCT GCTCACGGTG 
(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 33: 
CAAG TGAAC A GCACGTCC 
(2) INFORMATION FOR SEQ ID NOr34: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) * MOLECULE TYPE: DMA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34 
GACTATCTCA AGGACCAGCT G 
(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 
GGTTCGGTCC GAGCCCGG 



(2) INFORMATION FOR SEQ ID NO: 36: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 
GGAGCGATAC TCCAAGTAGG T 



(2) INFORMATION FOR SEQ ID NO:37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
AGCGGGCCAG GCCCCTTC 



(2) INFORMATION FOR SEQ ID NO: 38: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38 
CATCCTGGTC CAATGCGCTC 



(2) INFORMATION FOR SEQ ID NO: 39: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 
GCACTGAGGA AGTTAAACGA GC 



(2) INFORMATION FOR SEQ ID NO: 40: 

(i) SEQUENCE CHARACTERISTICS: 
(AJ LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 
GCTCGTTTAA CTTCCTCAGT GC 
(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GCTCAGCTCC ACAAAGCGGC T 

(2) INFORMATION FOR SEQ ID NO: 42: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



ACCAGCTCCG CTCAGGTAG 
(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear . 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43: 



TCCAGGAGCT GTGTGTTTGG 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44; 



CCAGTTTCAC AGCGTGAGG 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 



CAGCATGAGG AGGAGGCAG 
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BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 is a representation of the nucleotide sequence (SEQ ID NO: 2) and 
corresponding amino acid sequence (SEQ ID NO: 3) of mcgA. 

Figure 2 is a representation of the alignment of the human MCG4 amino acid sequence 
(amino acids: 1-60, SEQ ID NO: 46) (Query) with a translation of a partial murine expressed 
sequence tag (EST) (SEQ ID NO: 47) (Subject). 

Figure 3 is a representation of the alignment of the human MCG4 amino acid sequence 
(amino acids 7-109; SEQ ID NO: 48) (Query) with a translation of a partial nematode EST (SEQ 
ID NO: 49) (subject). 

Figure 4 is a diagrammatic representation showing a predicted structure of MCG4 where 
H and C represent histidine and cysteine residues, respectively and X refers to any amino acid 
residue. Zn represents zinc atoms. 

Figure 5 is a representation of sensitive sequence homology search of related cysteine- 
containing motifs in another Caenorhabditis elegans protein. Query: amino acids 56-100 (SEQ 
ID NO: 50). Subject: SEQ ID NO: 51. 

Figure 6 is a representation showing that a related cysteine containing motif is present in 
the GATA-binding transcription factor from Sacchromyces pombe. Queries: amino acids 35-58 
(SEQ ID NO: 52); amino acids 162-227 (SEQ ID NO: 54). Subjects: amino acids 175-198 (SEQ 
ID NO: 53); amino acids 441-506 (SEQ ID NO: 55). 

Figure 7 is a Northern blot showing expression of mcg4 in various cultured human cancer 
cell lines. Lanes 1-5, respectively, represent the hybridization signal from 15 (ig total RNA 
derived from various human cancer cells lines. Lanes 1-5, respectively, contain RNA from H69 
lung carcinoma cells, JAM ovary carcinoma cells, BT20 breast carcinoma cells, HaCat 
transformed keratinocytes, T24 bladder carcinoma cells. 

Figure 8 is a representation of a partial alignment of mcg4 with human ESTs AA074703 
and AA134788. Queries: nucleotides 446-704 (SEQ ID NO: 56); nucleotides 398-452 (SEQ ID 
NO: 58); nucleotides 767-810 (SEQ ID NO: 60); nucleotides 731-765 (SEQ ID NO: 62); 
nucleotides 701-732 (SEQ ID NO: 64); nucleotides 498-687 (SEQ ID NO: 66); nucleotides 398- 
495 (SEQ ID NO: 68); nucleotides 702-761 (SEQ ID NO: 70). Subjects: nucleotides 49-307 
(SEQ ID NO: 57); nucleotides 2-56 (SEQ ID NO: 59); nucleotides 373-416 (SEQ ID NO: 61); 
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nucleotides 336-370 (SEQ ID NO: 63); nucleotides 305-336 (SEQ ID NO: 65); nucleotides 103- 
292 (SEQ ID NO: 67); nucleotides 2-99 (SEQ ID NO: 69); nucleotides 309-368 (SEQ ID NO: 
1). 

Figure 9 is a representative of the partial nucleotide sequence alignment between a 
human (W32939) (SEQ ID NO: 72) and mouse (AA242159) (SEQ ID NO: 73) /wcg4-like EST 
in the putative 5' UTR of the mcgA cDNA. The putative initiation codon is underlined and the 
region upstream represents 5' UTR. 

Figure 10 is a representation showing Mac Vector alignment of MCG4 with forward 
translations of ESTs AA 134788 and AA074703. The nucleotide sequences are shown in Figure 
8. 

Figure 1 1 is a diagrammatic representation of the domains of MCG4 
zinc finger consensus: CX2HX4CX2CX4HX2CXi7CX2CXi 8 HX2CXi8CX 2 C (SEQ ID NO: 74); 
acidic domain consensus: 9/34 amino acids negatively charged, 0/34 positively charged; 
basic domain consensus: 13/55 amino acids positively charged, 0/55 negatively charged; 
leucine zipper domain consensus: LX 6 LX 6 RX6LX6L (SEQ ID NO: 75); 
alternate "novel" leucine zipper-like motif where leucine would not be aligned along the one 
surface to an alpha helix domain: (aa261) LX 6 LXLX 6 LXLX 6 L (aa286) (SEQ ID NO: 76). 

Figure 12 is a representation showing similarity of MCG7 WITH GEFs of various 
organisms. 

Figure 13(a) is a representation of the nucleotide sequence (SEQ ID NO: 4) and 
corresponding amino acid sequence (SEQ ID NO: 5) of mcgl. Nucleotides 183-288 are an 
alternative spliced exon (shown in lower case). 

Figure 13(b) is a representation of the partial nucleotide sequence (SEQ ID NO: 6) and 
corresponding amino acid sequence (SEQ ID NO: 7) OF mcgl but without the exon shown in 
Fig. 13(a). Amino acids have been numbered from the first methionine codon (underlined). The 
cDNA molecules of Fig. 13(a) and Fig. 13(b) differ by the inclusion and exclusion of the exon of 
nucleotides 183-288. 

Figure 14 is a representation showing a comparison between MCG7 (SEQ ID NO: 7) and 
a homologue from Caenorhabditis elegans (SEQ ID NO: 77) using the BESTFIT algorithm. In 
the figure, the following sequences are underlined: 



12 
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EF-Hand=PROSITE DATABASE NO. PD0C00018 



la nematode 



DVDEEDEVEDIEF (SEQ ID NO: 10) 
DVDGDGHISQEEF (SEQ ID NO: 1 1) 
DHDRDGFISQEEF (SEQ ID NO: 12) 
DQNQDGCISREEM (SEQ ID NO: 13) 



lb human 



nematode 



lc human 



nematode 



DVDMDGQISKDEL (SEQ ID NO: 14) 



GUANINE NT BINDING REGION = BLOCKS DATABASE NO. BL00720B 

2 human HFVHVAEKLLQLQNFNTLMAVVGGLSHSSISRLKETH (SEQ ID NO: 1 5) 
nematode KFVHVAKHLRKINNFNTLMSVVGGITHSSVARLAKTY (SEQ ID NO: 16) 

DaG-PE BINDING DOMAIN = PROSITE DATABASE NO. PD0C00379 

3 human HNFQESNSLRPVACRHCKALILGIYKQGLKCRACGVNCHKQCKDRLSVEC 

(SEQ ID NO: 17) 

nematode HNFHETTFLTPTTCNHCNKLLWGILRQGFKCKDCGLAVHSCCKSNAVAEC 
(SEQ ID NO: 18) 

Figure 15 is a representation of an alignment of human and a partial (5' UTR and partial 
coding sequence) murine mcgl cDNA (GenBank Acc.No. W71787 and AA237373). The 
putative initiation codon is underlined. The murine sequence (SEQ ID NO: 78) represents a 
composite of 2 partial cDNA sequences from the EST database (accession numbers W71787 and 
AA237373). Nucleotide differences between human and murine sequences are shown in lower 
case lettering and identical residues are indicated with asterisks. 

Figure 16 is a representation of further 5' nucleotide (SEQ ID NO: 79) and corresponding 
amino acid sequence (SEQ ID NO: 80) human mcgl. Nucleotide positions 1-321 were derived 
from GenBank Acc. No. AC000134 and nucleotides 322 onwards from Figure 13(a). Two in- 
frame initiation codons are underlined. Asterisks denote in-frame stop codons. 

Figure 17 is a graphical representation of a GDP release assay. □Experiment #1 (mean 
of duplicate). OExperiment #2 (mean of duplicates). The exchange reaction contained 36pmols 
of GST-MCG (N-terminally truncated; encoded by Construct B in Fig. 18) and 1.6-12.8 pmols of 
recombinant GST-N-Ras.GDP. Reaction time 6 min. 
Estimated reaction constants: 
K m = 2.1fiM, V max = 37 pMol/6min/36pMol [ExptJl] 
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K m = 1.5uM, V max = 30.3pMol/6min/36pMol[Expt.#2] 

Figure 1 8 depicts various recombinant plasmids containing partial or full-length mcg7. 

Figure 19 is a representation of the nucleotide sequence (SEQ ID NO: 8) and 
corresponding amino acid sequence (SEQ ID NO: 9) of meg] 8. 

Figure 20 is representation showing that MCG18 (amino acids 35-96, SEQ ID NO: 81] 
(Query) has partial homology to E. coli DnaJ (amino acids 6-65, SEQ ID NO: 82) (Subject). 

Figure 21 is a representation showing that MCG18 has homotology to two Caenorhabitis 
elegans proteins. Queries: amino acids 28-64 (SEQ ID NO: 83); amino acids 71-102 (SEQ ID 
NO: 85); amino acids 162-203 (SEQ ID NO: 87); amino acids 35-75 (SEQ ID NO: 89); amino 
acids 75-96 (SEQ ID NO:91); amino acids 141-184 (SEQ ID NO: 93). Subjects: amino acids 
22-58 (SEQ ID NO:84); amino acids 64-95 (SEQ ID NO: 86); amino acids 158-199 (SEQ ID 
NO: 88); amino acids 19-50 (SEQ ID NO: 90); amino acids 71-92 (SEQ ID NO: 92); amino 
acids 145-188 (SEQ ID NO: 94). 

Figure 22 is a representation showing that MCG18 has homology to a Saccharomyces 
pombe protein. Queries: amino acids 35-70 (SEQ ID NO: 95); amino acids 75-1 14 (SEQ ID 
NO: 97); amino acids 190-218 (SEQ ID NO: 99); amino acids 140-158 (SEQ ID NO: 101). 
Subjects: amino acids 9-44 (SEQ ID NO: 96); amino acids 50-89 (SEQ ID NO: 98); amino acids 
149-177 (SEQ ID NO:100); amino acids 44-62 (SEQ ID NO: 102). 

Figure 23 is a representation showing homology of MCG18 to a Drosophila virilis 
protein. Queries: amino acids 26-95 (SEQ ID NO: 103). Subject: amino acids 72-142 (SEQ ID 
NO: 104). 

Figure 24 is a representation showing homology of MCG18 to human DnaJ protein HDJ- 
2/HSDJ (SEQ ID NO: 105), HDJ-1/HSP40 (SEQ ID NO: 106) and HSJ1 (SEQ ID NO: 107). 

Figure 25 is a representation of the nucleotide (SEQ ID NO: 108) and corresponding 
amino acid sequence (SEQ ID NO: 109) of murine mcgl8. 

Figure 26 is a representation of homology between human and murine MCG18 (SEQ ID 
NO: 9 and SEQ ID NO: 109). 

Figure 27 depicts nucleotide sequence (SEQ ID NO: 1 10) corresponding to the 5' 
untranslated region of human mcg!8. 
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Figure 28 depicts a Northern blot showing expression of meg 18 transcripts in total RNA 
isolated from various human cancer cell lines grown in culture. Lanes 1-5 respectively contain 
15|ng RNA from H69 lung carcinoma cells, JAM ovary carcinoma cells, BT20 breast carcinoma 
cells, HaCat transformed keratinocytes, T24 bladder carcinoma cells. 



